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Method and System for Acoustic Shock Protection 

FIELD OF INVENTION: 

5 This invention relates to signal processing technology, and more particularly, 

to a method and system for acoustic shock protection. 

BACKGROUND OF THE INVENTION: 

10 Unwanted sounds, such as loud sounds or sounds that have a rapid relative 

increase in level, may be produced by telephone or radio systems, intentionally or 
unintentionally. Those sounds are typically experienced by a user through 
headphones or a headset. 

Without protection against those sounds, the user may experience a 

15 phenomenon known as acoustic shock. Acoustic shock may result in permanent 
hearing loss, temporary hearing loss and tinnitus (constant ringing in the ears). 
Sufferers also report symptoms including extreme pain, vertigo and burning 
sensations. One of the main issues in the cause of acoustic shock syndrome, apart 
from the initial high-level sound, is the startle reflex action. This reaction can cause 

20 numerous muscles to activate to an unusual degree. 

In order to prevent the user from experiencing acoustic shock, the following 
approaches are provided in telephone systems: 1- Automatic gain control; 2- 
Adjustable notch filters to remove narrow band tones or "shrieks" when they are 
detected; 3- Clipping of high level signals using diodes or similar devices: 

25 These approaches have also been combined. All of these approaches use 

techniques that are well known in the art and have been seen in other application 
areas (e.g. hearing aids). 

The existing devices offer some protection. However, the processed output 
signal of those devices has reduced fidelity compared to an input signal. Typical 

30 distortions of the signal include: "pumping" (unnecessary and audible adjustments of 
the gain that adversely affect the perceived quality of the processed speech signal) 
and "holes" (audio dropouts in the processed signal caused by extreme gain 
adjustments), harmonic distortion as well as the accompanying intermodulation 
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distortion that comes from poor gain control. More complex systems may also suffer 
from excessive input-output latency (i.e., group delay), which can adversely impact 
network, acoustic and line echo cancellers. 

Currently there are specifications under development that provide guidelines 
5 and recommendations for the performance of acoustic shock systems. These 
include: ITU-T Recommendation P.360 "Efficiency of devices from preventing the 
occurrence of acoustic pressure by telephone receivers"; UK standard BS6317 
specified continuous signals only; US standard, UL1950; EN60950 
(http://www.ktl.com/telecoms/shock.htm). 
10 To meet these emerging performance requirements and deliver high-fidelity 

with low group delay, a new approach is needed. It is, therefore, desirable to provide 
a new method and system that can fully protect a user against the acoustic shock so 
as to meet the above guidelines and future guidelines that may emerge. 

15 SUMMARY OF THE INVENTION: 

It is an object of the invention to provide a novel method and device that 
obviates or mitigates at least one of the disadvantages of existing systems. 

In accordance with an aspect of the present invention, there is provided a 

20 method of providing protection against acoustic shock, which includes the steps of: 
performing a pattern analysis on an input signal to identify a parameter space 
corresponding to a signal space of the input signal; applying a rule-based decision to 
the parameter space to detect an acoustic shock event; and removing the acoustic 
shock event from the input signal to generate a processed output signal. 

25 In accordance with a further aspect of the present invention, there is provided 

a method of providing protection against acoustic shock, which includes the steps of: 
performing a weighted overlap-add (WOLA) analysis on an input signal; performing 
feature extraction on the input signal and performing feature extraction on the 
plurality of band signals provided by the WOLA analysis; detecting an acoustic shock 

30 event based on feature extractions from the input signal and band signals; performing 
gain control based on the acoustic shock event detection; applying a plurality of 
calibrated gains to the band signals to reduce the level to meet a predetermined safe 



level; and performing a WOLA synthesis on the modified band signals (modified by 
the application of gain) to synthesize an output signal. 

In accordance with a further aspect of the present invention, there is provided 
a system for providing protection against acoustic shock, which includes: an analysis 

5 module for performing a pattern analysis on an input signal to identify a parameter 
space corresponding to a signal space of the input signal; a detection module for 
applying a rule-based decision to the parameter space to detect an acoustic shock 
event; and a removal module for removing the acoustic shock event. 

In accordance with a further aspect of the present invention, there is provided 

10 a system for providing protection against acoustic shock, which includes: a weighted 
overlap add (WOLA) analysis module for transforming an input signal to a plurality of 
band signals; a feature extraction module for performing feature extraction on the 
input signal and for performing feature extraction on the band signals; a detection 
module for detecting an acoustic shock event based on the feature extractions from 

15 the input signal and the band signals; a gain control module for performing gain 
control based on the shock detection and the extracted features of the input signal 
and the band signals; and a calibration module for applying calibrated gains to the 
band signals to reduce the level to meet a predetermined safe level; and a WOLA 
synthesis module for synthesizing the modified band signals to provide an output 

20 signal. 

In accordance with a further aspect of the present invention, there is provided 
a system for providing protection against an acoustic shock, which includes: a 
weighted overlap-add (WOLA) analysis module for transforming an input signal into 
a plurality of oversampled sub-band signals in a frequency domain; a processing 
25 module for adaptively processing the sub-band signals to remove an acoustic shock 
event: and a WOLA synthesis module for synthesizing the processed sub-band 
signals to provide an output signal. 

In accordance with a further aspect of the present invention, there is provided 
an acoustic shock protection method and system, in which a weighted overlap-add 
30 (WOLA) filterbank is used and a delay unit is used prior to WOLA analysis enabling 
predictive features to be fed around the WOLA filterbank analysis stage. 

Here, the inherent delay of the WOLA analysis filterbank processing is 
advantageously used, reducing the necessary amount of added delay while 
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maintaining a low overall group delay. The low group delay of the WOLA filterbank is 
a considerable advantage. 

Other aspects and features of the present invention will be readily apparent to 
those skilled in the art from a review of the following detailed description of preferred 
5 embodiments in conjunction with the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS: 

The invention will be further understood from the following description with 
10 reference to the drawings in which: 

Figure 1 is a schematic diagram showing an acoustic shock protection system 
in accordance with an embodiment of the present invention; 

Figure 2 is a schematic diagram showing the process of Figure 1 ; 
15 Figure 3 is a schematic diagram showing one example of the acoustic shock 

protection system of Figure 1 , which has a WOLA filterbank; 

Figure 4 is a schematic diagram showing one example of the WOLA based 
architecture of Figure 3; 

Figure 5 is a schematic diagram showing one example of the shock state 
20 machine of Figure 4; 

Figure 6 is a schematic diagram showing one example of an input-output 
function of the gain control of Figure 4; 

Figure 7 is a flow chart showing one example of implementing gain calculation 
at the gain calculation block of Figure 4; 
25 Figure 8 is a block diagram showing a sub-band adaptive periodic noise 

cancellation technique in accordance with another embodiment of the present 
invention; and 

Figure 9 is a schematic diagram showing one example of the sub-band 
periodic noise cancellation block of Figure 8. 

30 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS: 



Figure 1 shows an acoustic shock protection system 2 in accordance with an 



embodiment of the present invention. The acoustic shock protection system 2 
includes an analysis block 4, a detection block 6, a removal block 8 and a logging 
block 10. The system 2 further includes a calibration block 12 that contains data used 
to calibrate the input and output levels and to support user preferences (e.g. support 

5 for personalization). The arrows in Figure 1 show how these major blocks 
interconnect and interact with each other. 

The acoustic shock protection system 2 implements a pattern 
analysis-based approach. The approach employs a collection of input signals, 
referred to as a "signal space", which contains a representative subset of acoustic 

10 shock and non-acoustic shock signals that the system will process and, in the case 
of input signals that contain an acoustic shock event, protect against in typical 
operation. This signal space is processed via a feature extraction process at the 
analysis block 4 to realize a reduced dataset representation of the signal space. 
The collection of reduced dataset representations, one for each of the signals in the 

15 signal space, is referred to as a parameter space. 

The feature extraction process includes one or more measurements. The 
pattern analysis-based approach uses the measurements to determine if an acoustic 
shock is present. The acoustic shocks may include high level, short duration (e.g. a 
loud sound); level versus time exposure; rapid increase in relative level exposure 

20 and/or combination thereof. 

The analysis block 4 may use, for example, artificial neural networks, hidden 
Markov models, Bayesian estimation methods, signal statistics or combinations of 
signal parameters such as short-term and long-term level over various frequency 
ranges and time-scales. In some situations, it may be necessary to measure and 

25 protect against level versus time exposure to signal levels. In combination with the 
other subcomponents of the analysis block 4, a timer component 14 in the analysis 
block 4 provides the capability to protect against level-over-time exposure. 

The detection block 6 contains a logical description in terms of inference rules 
representing patterns of analysis output conditions representing acoustic shock 

30 events. The detection block 6, for example, may take the form of a rule-based expert 
system. 

The removal block 8 implements the signal processing necessary to remove 
the shock event from the input signal, typically by the application of time, level or 



frequency-dependent gain or a combination thereof, while maintaining the fidelity of 

the output signal, relative to the input signal. 

The logging block 10 implements on-line data collection of the key parameters 

of shock events. This information is time-stamped using the timer 14 or another 
5 similar means and may be categorized to provide accurate data records for later 

downloading and analysis. 

The calibration block 12 provides data that is required to account for the 

acoustic and electrical differences between various headphones, headsets, systems 

and other components. The data from the block 12 ensures that the output level 
10 experienced by the user of the system 2 does not exceed a preset maximum as a 

function of time and frequency. The preset maximum may be changeable for each 

user. The logging block 10 and the calibration block 12 may be detachably attached 

to the system 2. 

In Figure 1, the analysis block 4, the detection block 6 and the removal block 
15 8 are shown separately. However, the component of the analysis block 4 may be 
used as the component of the detection block 6 or the removal block 8. The 
component of the detection block 6 may be used as the component of the removal 
block 8. Processing done in each of the blocks may be reused in other blocks to 
minimize the computational and memory requirements for implementing the 
20 invention. 

For example, the system 2 may re-use some of the features, which are used 
during analysis, as information (e.g. level measurements) to perform gain control in 
the removal block 8. The measurements used to perform analysis and detection at 
the analysis block 4 and the detection block 6 are reused to perform the signal 
25 processing necessary for acoustic shock removal. Similarly, these measurements 
can be used as a portion of the data collected by the logging block 10. 

The pattern analysis-based approach in the acoustic shock protection system 
2 also provides the flexibility to effectively deal with a wide range of input signal types. 
This identification mechanism/method effectively makes use of various signal 
30 parameters to identify acoustic shock events. 

A feedforward configuration is used to provide fast-acting and intelligent gain 
control. In a preferred embodiment, the gain applied to the processed signal is 
controlled by a set of gain control rules that receive, as input, the output(s) from the 



pattern analysis. While gain control is highly useful in removing acoustic shock when 
only a single signal is available, the addition of adaptive filtering provides further 
performance for periodic signal disturbance sources or in applications where a 
secondary input is available. The gain application and adaptive filtering blocks are 

5 seen as complementary. The gain control block acts quickly to remove acoustic 
shock transients. In cases where the shock is periodic noise or predictable from a 
secondary input, the adaptive filtering block converges to remove this noise. The 
details of the adaptive filtering block will be presented later; for now, the discussion 
will assume only a single input is available. 

10 The entire implementation is parameterizable. This allows the flexibility 

needed to deal with a wide range of environments and acoustic configurations. 

Figure 2 is a schematic diagram showing the signal processing for the system 
in Figure 1 . Referring to Figures 1 and 2, a signal space 20 is mapped to a parameter 
space 22 defined by [X1 , X2, X3, X4, X5..], by performing the feature extraction at the 

15 analysis block 4 of Figure 1 . The detection block 6 makes a rule-based decision 
(rules 24) regarding the presence or absence of an acoustic shock event using the 
parameters. Then, the removal block 8 of Figure 1 performs gain control 26 to 
remove the acoustic shock event. 

This approach allows the system 2 to be flexible and trainable. The signal 

20 space 20 can always be extended to account for a signal class or type for which 
protection must be provided. The rules 24 used for the gain control 26 can be 
adjusted to meet new requirements. Preferably, the rules 24 are implemented and 
adjusted in software or firmware. 

In one embodiment, an oversampled filterbank, such as a weighted 

25 overlap-add (WOLA) filterbank, is applicable to the pattern analysis-based approach 
of acoustic shock detection. The WOLA filterbank is described in United States 
Patent No. 6,236,731, "Filterbank Structure and Method for Filtering and Separating 
an Information Signal into Different Bands, Particularly for Audio Signal in Hearing 
Aids" by R. Brennan and T. Schneider, issued on May 22, 2001, and United States 

30 Patent No. 6,240, 1 92, "Apparatus for and method of filtering in an digital hearing aid, 
including an application specific integrated circuit and a programmable digital signal 
processor" by R. Brennan and T. Schneider, issued on May 29, 2001 , which are 
incorporated herein by reference. 



The WOLA filterbank includes a WOLA analysis filterbank and a WOLA 
synthesis filterbank. The WOLA analysis filterbank receives a plurality of information 
signals in the time domain and transforms the information signals into a plurality of 
band signals in the transform (frequency) domain. The WOLA synthesis filterbank 

5 receives a plurality of band signals in the transform (frequency) domain and 

transforms the band signals into a single information signal in the time domain. A data 
processor for processing the output of the WOLA analysis filter bank may be provided 
between the WOLA analysis filterbank and the WOLA synthesis filterbank. 

Figure 3 shows one example of the acoustic shock protection system 2 of 

10 Figure 1 . The acoustic shock protection system 2 of Figure 3 includes a feature 
extraction block 44, a shock detection block 46, a gain control logic block 48, a WOLA 
analysis filterbank 50, a WOLA synthesis 52 and a complex multiplier 54. The WOLA 
analysis filterbank 50 and the feature extraction block 44 correspond to the analysis 
block 4 of Figure 1 . The shock detection block 46 corresponds to the detection block 

15 6 of Figure 1 . The gain control logic block 48 corresponds to the removal block 8 of 
Figure 1 . 

The feature extraction block 44 performs level measurements. For example, 
the level measurements include fast level measurement, slow level measurement, 
gradient measurement, broadband measurement in the time-domain and 
20 narrowband measurement in the filterbank bands or groups of bands. The gain 
control logic block 48 selects appropriate level measurements and performs gain 
calculations. 

The WOLA analysis filterbank 50 of the WOLA filterbank is used to divide the 
input signal into a plurality of frequency bands. These bands and the raw input signal 
25 are then processed individually or in combination to extract relevant signal features in 
the feature extraction block 44. This combination of processing steps is implemented 
at the analysis block 4 of Figure 1 . The results of the feature extraction are then 
passed to the shock detection block 46. The shock detection block 46 makes a 
, rule-based decision about the presence or absence of an acoustic shock event. The 
30 shock detection block 46 performs the role of the detection block 6 of Figure 1 . The 
shock detection decision is then used in conjunction with the extracted features to 
perform intelligent gain control at the gain control logic block 48. 

The gain control logic block 48 has inputs from both the feature extraction 44 



and the shock detection block 46. For example, the gain control logic block 48 may 
implement gain control logic using static gain curves in each sub-band or by applying 
a gain calculated from the raw input signal level, or by a combination of the two. The 
application of the gain to these bands provides the control required to implement 

5 acoustic shock protection. The complex-multiplier 54 (used in real mode in this 
application) multiplies the output of the WOLA analysis filterbank 50 and the output 
of the gain control logic block 48. The WOLA synthesis block 52 of the WOLA 
filterbank receives the outputs of the multiplier 54 and transforms them into audio 
output in the time domain. 

10 The acoustic shock protection system with the WOLA filterbank provides high 

fidelity and low group delay. It also provides high adjacent band isolation that permits 
high-fidelity removal of frequency isolated shock events, such as tones or other 
narrow band signals. This implementation can be advantageously realized on the 
system architecture disclosed in U.S. Patent No. 6,236,731 and U.S. Patent No. 

15 6,240,192. 

Figure 4 shows one example of the feature extraction block 44, the shock 
detection block 46 and the gain control logic block 48 of Figure 3. 

The feature extraction block 44 of Figure 4 employs time-domain 
(pre-analysis) and frequency-domain (post-analysis) measurements. The feature 
20 extraction block 44 of Figure 4 includes fast and slow time-domain broadband level 
measurements and a vector of fast and slow narrowband level measurements 
corresponding to each frequency band. 

The time domain measurements are now described in detail. In Figure 4, a 
fast exponential average block ("Fast Exp. Avg.") 60, a slow exponential average 
25 block ("Slow Exp Avg.") 62 and a summer 64 implements the time domain 
measurements. 

The time domain measurements can be first-order fast and slow exponential 
averages of the RMS (root mean square) signal level that are well-known in the art. 
However, other level measurement techniques, such as short or long term minimum 
30 and maximum signal levels, order statistics (e.g., median), outputs from distribution 
measurements (e.g., the level that represents the 95 th percentile) or other similar 
measurements well known in the art may be applicable. 

The difference between these levels is a feature, which is sent to the shock 
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decision block 74. Furthermore, these levels are calculated ahead of the WOLA 
processed signal using a delay block 72 ( u z~ n ° " in Figure 4) that provides a time delay 
corresponding to n Q samples. This delay permits the time-domain measurements to 

indicate the presence of a high level (and a potential acoustic shock event) to the 

5 shock detection rules before the shock appears in the plurality of outputs from the 
analysis operation. A fundamental property at the onset of any large acoustic 
transient or shock is that it is initially indistinguishable from a broadband disturbance. 
Only later, as the shock progresses through the analysis filterbank, is further 
classification possible indicating the presence of narrowband/tonal noise or 

10 broadband noise. A rapid increase in time-domain energy, therefore, indicates that 
some form of shock condition is initiating and signals that a sensitization of the 
frequency domain detectors located in the frequency bands after the analysis stage 
should occur. This sensitization allows the gain logic to preemptively adapt 
eliminating a possible, brief high-level transient at the onset of a high level, because 

15 the gain control has already "seen" the high-level and had, if necessary, reduced the 
relevant gain or gains accordingly. The WOLA analysis filterbank 50 processes the 
output of the delay block 72. This parameterizable feed-forward delay allows the 
measurements to indicate the presence of shock one or more processing blocks 
before the shock would be (if unmodified) experienced by the end user, thereby 

20 improving protection. The inherent group delay of the WOLA analysis is used 
advantageously to reduce the delay time n 0 implemented by the delay unit 72, in 
some instantances permitting the delay no to be reduced to zero with the WOLA 
analysis used to provide all of the required delay. 

The frequency domain measurements are now described in detail. In Figure 

25 4, a fast exponential average block ("Fast Exp. Avg.") 66, a slow exponential average 
block ("Slow Exp Avg.") 68 and a summer 70 implement the frequency domain 
measurements. 

The frequency domain measurements can also be fast and slow exponential 
averages of the RMS signal level in each frequency band (as are well-known in the 
30 art). Other level measurement techniques, such as short or long term minimum and 
maximum signal levels, order statistics (e.g., median), outputs from distribution 
measurements (e.g., the level that represents the 95 th percentile) or other similar 
measurements well known in the art are also possible. The difference or other 
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comparative measurement between these levels is a feature, which is sent to the 
shock decision block 74. 

The blocks 60 and 62 may act as a dual time-constant detector, which have 
averaging filters and detect current signal levels in accordance with the following 
equations (1) to (3): 



L(/)=201og 



CALIB LEVEL 



(1) 



Lfast{l) = a/astL(i) + (1 - a/ast)Lfast(i - 1) (2) 

Lsiow(i) = cisiowL(i) + (1 - cisiow )Lsbw(i - 1) (3) 

10 Where L(i) indicates the level measurements of the i th block; 

a is a time constant determined from the desired filter time constants; 
R is the block size used for WOLA analysis; 
Xj are the block samples; 

CALIB_LEVEL is a calibration level chosen during configuration; 
15 the subscripts of "fast" and "slow" indicate the fast and slow filters respectively. 



This detector computes exponentially averaged level measurements using a 
fast time constant and a slow time constant. The dual time-constant detector 
extracts current signal levels through the two measurements and detects the 

20 presence of relatively loud transients through the difference between the two 
measurements. The detector operates in dB levels but may also operate on 
non-logarithmic power measurements. Using dB levels, the difference is the ratio of 
the power measurements in each filter. 

Similarly, the blocks 66 and 68 may act as a dual time-constant detector for 

25 each sub-band, which detect current signal levels in accordance with the following 
equations (4) to (6): 

w 



Iy(0= 20 log 



(4) 



\ CALIB _ LEVEL) 
If (/) = af Lj(i)+ (1- af )Lf (/- 1) (5) 
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Ufij) = a^Ljii)* (1- a^L^ii- 1) (6) 

Where L/0 indicates the level measurements of the /-th block in the y-th band; 
Xj is the current complex measurement in the y-th band; 
a is a time constant for that band's filter; 

the superscripts of "fast" and "slow" indicate the fast and slow filters 
respectively. 

These detectors provide information about the presence of narrowband shock 
conditions to the shock detection rules. These detectors operate in dB levels but may 
also operate on non-logarithmic power measurements. Using dB levels, the 
difference is the ratio of the power measurements in each filter. 

The shock detection and gain control logic 80 of Figure 4 is now described in 
detail. The shock detection and gain control logic 80 includes the shock decision 
block 74, a shock state machine 76 and a gain control block 78. The shock detection 
and gain control logic 80 corresponds to the shock detection block 46 and gain 
control logic 48 of Figure 3. 

The shock decision block 74 of Figure 4 makes a decision about the presence 
of acoustic shock at both the broadband and narrowband levels. A large value in 
either the time-domain feature or the band-level features indicates a quick increase in 
relative sound exposure level for the end user. The thresholds may be set by 
experimentation and heuristics. Alternatively, these could be set using automatic 
training methodologies such as artificial neural networks. 

The decision at the shock decision block 74 is then used by a shock state 
machine 76, which uses a rule-base to decide whether the current exposure level is 
a potentially harmful acoustic shock event. The shock state machine 76 coupled with 
the shock decision block 74 corresponds to the detection block 6 of Figure 1 . 

For example, the shock detection block 46 outputs a flag, such as yes/no flag, 
for overall shock and a flag, such as yes/no flag, for shock within each sub-band. The 
overall shock flag is used internally for the shock detection. The sub-band shock flag 
is used for gain calculation. Figure 5 shows one example of the shock state machine 
76 of Figure 4. The shock state machine 76 of Figure 5 contains a overall shock state 
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machine 110 and a band shock state machine 112. The overall shock state machine 
110 examines a difference between the time-domain level measurements to 
determine a current condition, i.e., "no shock" state 1 14, "shock" state 1 16 or "hold 
shock" state 118. The "hold shock" state compensates for the feedforward delay 72 
5 so that an overall shock condition is not prematurely terminated before the input 
signal has reached the plurality of output bands from WOLA analysis block 50. The 
band shock state machine 112 examines a difference between the sub-band level 
measurements to determine a current condition, i.e., "no shock" state 120 or "shock" 
state 122. 

10 In Figure 5, Arms represents a parameterizable threshold in time-domain, and 

Aband represents a parameterizable threshold in sub-band domain. The general 
condition for shock is either a time-domain level difference greater than the threshold 
Arms, or a sub-band level difference greater than the threshold A ba nd. 

Shock is also automatically asserted when the fast level measurements 

15 exceed specified maximum level thresholds labeled L max for the time-domain level 
and Uand-maxfor the sub-band level measurements. Shock is exited only when the 
maximum level threshold is no longer exceeded and the measurement difference has 
dropped back to zero or less. The maximum thresholds deal with the case of 
absolute shock, where the level is higher than the tolerated output. The difference 

20 thresholds deal with the case of relative shock, where the average level of the signal 
has increased drastically in a short period of time and could be shocking. In addition 
to these level based transition guards, there is also a hold time T h0 id and the hold 
shock state 1 18 associated with the overall shock state. This is to compensate for the 
time domain measurement "look-ahead" caused by the digital delay. The hold time 

25 Thoid is configurable, and is greater than or equal to the delay parameter (n 0 ) divided 
by the WOLA analysis block size (R). 

Referring to Figure 4, the acoustic shock state, i.e. whether acoustic shock is 
currently occurring or not, is then used by the gain control logic at gain calculation 
block 78 of Figure 4. The gain calculation block 78 corresponds to the removal block 

30 8 of Figure 1 . 

The output of the shock decision can be used in the logging portion (e.g. block 
10 of Figure 1). 

The detector 6 of Figure 1 , 46 of Figure 3 and 74/76 of Figure 4 may use fuzzy 
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logic, neural networks and/or combinations of overall-level and band-level control. 
Preferably, the frequency-domain processing is implemented with delay 72 of Figure 
4 before the WOLA analysis filterbank 50, with the inherent WOLA delay being 
sufficient to encompass the entire delay time needed in some applications, to obtain 

5 look-ahead functionality. 

In one embodiment, the gain calculation block 78 employs a gain control using 
static input-output (IO) curves for each sub-band. By using a separate IO curve for 
each sub-band, frequency dependent level limiting can be achieved. The 
input-output behavior is implemented internally by the algorithm as an input-gain 

10 function. Figure 6 shows one example of the IO function of the gain control. The 
entire curve is specified by the expansion region slop S1, the compression region 
slop S2, the limiting region slop S3 and the co-ordinates of the two knee-points, which 
implicitly sets the compression ratio. Calibration for input and output analog 
responses can be taken into account by adjusting the gain applied accordingly so that 

15 the desired IO curve is achieved. 

The choice of what input level to use during the gain control is controlled by the 
output of the shock state machine 76. During a "shock" condition, peak 
instantaneous levels are tracked and used for the gain calculations. Using the 
instantaneous levels ensures that the shock is suppressed immediately and tracking 

20 a peak ensures that the gain is steady during an instance of acoustic shock, thereby 
improving fidelity. 

Output gain is reduced in bands where acoustic shock is detected in order to 
protect the user. The fast and slow level measurements from the feature extraction 
block, combined with instantaneous levels are used to compute the desired gain. 

25 Confining the output attenuation to only the bands where shock is prevalent 
minimizes any distortion of the output and thus provides improved fidelity. 

These gain control details, such as maximum output level, and shock decision 
thresholds constitute the parameters that are available for personalization and 
calibration (e.g. block 12 of Figure 1). 

30 In another embodiment, the gain calculation block 78 performs gain 

calculation as shown in Figure 7. Figure 7 shows one example of the gain calculation 
at the gain calculation block 78. In step S2, the time domain shock flag F t (i) for the 
given i m input block is determined based on the measurements of Lfast (i) and L s i 0 w (i). 
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In step S4, a broadband gain G b b(i) for the i m block (i.e. block i) is calculated in 
response to the time domain shock flag F t (i). In step S6, a narrowband shock flag 
Ff(j,i) for band j, block i is evaluated. In step S8, a narrowband gain G nb (j,i) for the 
band j, block i is calculated in response to the narrowband shock flag F f (j,i). In step 

5 S10, a gain weighting ratio r(i) for the block i is calculated. In step S12, a final gain 
GfinaiG. 0 is calculated using the gain weighting ratio r(i). 

The calculation of the broadband gain G b b(i) is described in detail. At the 
beginning of the shock, which is detected in the time domain (F t (i)=1), a broadband 
counter n b b is set to count down the number of frames which are affected only by the 

10 broadband gain for reducing the transient effect. This number can be a fixed value, 
or is configurable as an adaptive value. The broadband gain G bb (i) is calculated 
based on the time domain shock flag F t (i) and the broadband counter n bb in 
accordance with the equation (7): 
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G bb (o = o 



ifF ( (i) = Q 
f(F t (i) = \A 
if(F t (i) = \An bb (i)<0) 



if(F t (i) = \An bb (i)>Q) 



(7) 



where L ins t(i) and L avg (i) are the instantaneous and fast average energy levels of the 
input block, respectively; and T bb is a broadband threshold, which is configurable. A 
user can determine the threshold value T bb by calibrating the system for limiting the 

20 output to certain dB SPL (Sound Pressure Level) value. 

Initially the time domain shock flag F t (i) is checked to determine the 
occurrence of a shock. If F t (i) is zero, i.e. no shock, the broadband gain G b (i) is set to 
0 dB, to not affect the incoming signal in normal situations. If F t (i) is equal to one, the 
signal is carrying a shock and the broadband gain has to be calculated as the 

25 difference between the energy level and the time domain threshold value. At this 
point, the energy level has to be chosen based on the position of the processed block 
relative to the beginning of the shock. This is due to the transient reduction function. 
The transient reduction function applies only the broadband gain at the beginning of 
the transient time when the shock is detected only in time domain. Therefore, to 

30 compress the shock fully during the transient time, it is required to use instantaneous 
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energy level for the given block. Since the broadband counter n b b represents the 
count down for the duration of the transient effect, the instantaneous energy level 
Lavg(i) is used for the blocks with n b b greater than zero, otherwise, for minimizing the 
fluctuation on the gain values, the average energy level L avg (i) is used to calculated 
5 the gain. 

The narrowband shock flag Ff(j,i) is detected using the shock state machine as 
described in Figure 5. The computation of the narrowband gain G n b(j,i) is described 
in detail. The narrowband gain G n bGJ) is calculated based on the status of the 
frequency domain shock flag Ff(j,i) in accordance with the equation (8) where j is the 

10 band index and i is the input block number. If there is no shock detected in the band 
i (Ff(j,i)=0), the narrowband gain G nb (j>i) is set to OdB, otherwise, the gain G„bG j) is 
calculated as the difference between the energy level of the band and its 
corresponding threshold value T n bG). At this point, similar to the broadband gain, the 
instantaneous energy level Lj ns t(j» i) is used to calculate the gain when the input block 

15 is during the transient time (i.e. n b b is greater than zero), and the average energy level 
Lavg(i) is applied if the input block is after the transient time (i.e. n b b is equal to zero). 
This ensures that the shock is fully compensated during the transient time and the 
gain variations are minimized during the shock and after the transient time: 
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0**0.0-0 ifF f (j 9 i) = 0 

G nbW = W 7 ' 0 " T nb W ViFfUA = 1 a n bb [i) < 0) 



The calculation of the gain weighting ratio r(i) is now described in detail. The 
gain weighting ratio r(i) is a key parameter for calculating the appropriate gain value 
for each band. It defines the amount of contribution made by the broadband gain and 
25 narrowband gain to the final gain value, based on the time domain and frequency 
domain shock states. The equation (9) represents the conditions used to calculate 
the gain weighting ratio r(i): 
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K0 = 1 i/{F,0) = 0} 

' KO = 0 if{F t (i) = 1 a (3F / (l : AT,/) = 0)} (9) 

KO = 10 [Lmax(,) " (0 = 1 a (3Fy(l : = 1)} 

where N is the number of the bands; and 

L total (i) is the sum of the energy levels of all the bands evaluated for the input 

5 blOCk i: L totaJ (i)=I10 L inst (n ' i ) 
n=l 

The value r(i) is calculated per sample block and it considers the overall 
frequency domain shock flag to be equal to one when at least one band is in shock. 

The value r(i) is set to one during the normal conditions where no shock is 
detected (i.e. F t (i) = 0 and VFK1 :N,i) = 0) and also at the end of a shock (i.e. F t (i) = 0 

10 and VFfO:N,i) = 1). This is due to the fact that by setting the gain ratio to one during 
the normal conditions, only the narrowband gain is enabled and by changing the gain 
values, the frequency response of the system can be modified. Also, at the end of a 
shock, because of the "inherited blocks" from the shock period in the analysis 
window, the frequency bands carry high energy levels while no time domain shock 

15 has been detected. This results in an overshoot in the synthesis output if no gain is 
applied. This effect can be managed by enabling the narrowband gain and using the 
average energy band levels L avg GJ) to calculate the gain values. 

During the transient time, initially, the shock is detected only in the time 
domain (i.e. F t (i) = 1 and VF f (1:N,i) = 0) and as a result only the broadband gain can 

20 be used to compress the shock. Therefore for full shock compensation, the gain ratio 
is set to zero. The narrowband gain is brought to the picture automatically, as soon 
as the band levels reach the threshold values. 

During the steady state of the shock (just after the transient and before the 
end), the r(i) is calculated as the ratio of the maximum energy level carried by any of 

25 the bands in block i (L max (i) = Max(L ins t(1 :N,i))), over the total energy carried by all the 
bands in corresponding block, L to tai(")- By having this value as the gain ratio, the 
concentration of energy over the spectrum is calculated and the energy of a band is 
reduced based on its contribution to the total energy level. In spite of the fact that this 
values is the same for all the bands, the final gains are still customized for the bands 
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in shock based on their narrowband gains. 

The calculation of the final gain Gfj na i(jJ) is now described in detail. The final 
gain for each band j, is calculated in accordance with the equation (10): 



According to an aspect of the present invention, processing in a plurality of 
frequency domain bands provides the flexibility needed to effectively protect against 
wideband (in frequency) and narrowband shock signals. For narrowband signals (e.g. 
tones or dual-tones), band-based gain control and/or adaptive processing provides 
localized gain control that results in high fidelity because gain adjustments are only 
made in the frequency ranges where they are necessary. 

Periodic acoustic shock signals can be virtually eliminated using sub-band 
implementations of Least Mean Squares (LMS) periodic interference cancellation 
techniques as described in, for example, B. Widrow and S. D. Stearns, "Adaptive 
signal processing." Englewood Cliffs, NJ: Prentice-Hall, 1985., which is incorporated 
herein by reference. These may be implemented at the block 8 of Figure 1 in 
subband outputs from the WOLA analysis as described below. 

Sub-band adaptive processing disclosed in Canadian Patent Application No. 
2,397,080, entitled by "Sub-band Adaptive Signal Processing In An Oversampled 
Filterbank" and Canadian Patent Application No. 2,437,477, entitled by "Method and 
System for Processing Subband Signals Using Adaptive Filters", which are 
incorporated by reference herewith, is applicable to the sub-band processing of the 
present invention. 

Figure 8 shows an acoustic shock protection system 90 in accordance with 
another embodiment of the present invention. The audio signal contains the desired 
speech and acoustic shock event that is assumed to be a periodic noise source. The 
acoustic shock protection system 90 employs sub-band adaptive periodic noise 
cancellation technique and acts as a periodic noise canceller. 

The system 90 of Figure 8 includes the WOLA analysis filterbank 50, the 
WOLA synthesis filterbank 52 and a processing block 92. The processing block 92 




(10) 
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includes a plurality of sub-band periodic noise cancellation blocks 94A-94N, which 
corresponds to N sub-bands. The WOLA analysis interbank 50 transforms audio 
input into a plurality of sub-bands. Each sub-band is then passed to the 
corresponding periodic noise cancellation block. The WOLA synthesis filterbank 52 
5 combines the outputs of the processing block 92 into a single signal. 

The system of Figure 2 may further contain the sub-band periodic noise 
cancellation block of Figure 8 to improve acoustic shock cancellation. As described 
above, the system of Figure 2 works quickly to remove the acoustical trauma while 
the sub-band periodic noise cancellation goes a step further to actually cancel the 

10 interference whenever a periodic acoustical shock is present. 

Figure 9 shows one example of the sub-band periodic noise cancellation block 
94 of Figure 8. The sub-band periodic noise cancellation block 94 includes a delay 
block 96, an adaptive filter 98 and a summing block 100. 

A sub-band input is delayed at the delay block 96. The delayed version of the 

15 audio sub-band input is filtered by the adaptive filter 98 and then subtracted from the 
original sub-band input by the block 1 00. The resulting error signal from the block 1 00 
acts as the sub-band output. The sub-band output is also applied to the adaptive 
filter 98 to update the adaptive filter 98 using the LMS algorithm or other similar 
techniques. The adaptive filter 98 will cancel the periodic noise because it is 

20 correlated with a time-delayed version of itself whereas the desired speech will 
remain because it is not similarly correlated with the delayed input. 

In practice, it may be desirable to only use the filtered output of the summing 
block 100 when strong periodicity is detected. When weak or no periodicity is 
detected, the sub-band output is made equal to the sub-band input to avoid 

25 degradation of the signal. This periodicity detector may be made by allowing the 
sub-band filtering block to always operate while observing the variance of the filter 
coefficients. During regions where the variance is high, above a settable threshold, 
periodicity would be declared, activating a switch to the filtered output. Conversely, 
where the variance is low, below a settable threshold which may be different from the 

30 previous threshold to implement hysteresis, periodicity would be declared absent and 
switching the output to use the unfiltered input. Other versions are envisaged as well, 
where the periodicity detection is done with fuzzy logic and the output is a weighted 
sum of the filtered and unfiltered inputs according to the certainty of the periodicity 
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detection. 

The embodiments of the acoustic shock protection system act as an intelligent 
limiter, which quickly and accurately recognizes acoustic shock conditions and 
quickly eliminates the acoustic shock conditions. It can provide broadband and 
5 narrowband attenuation of the received signal. The system may limit the output 
levels from the headset to a specified maximum that is frequency dependent. The 
system may provide attenuation of sounds that is below the maximum outputs, but 
are significantly above the current average sound levels. 

The pattern analysis-based approach of the present invention provides high 
10 fidelity and low group delay in the processed output signal relative to existing devices. 

According to a further aspect of the embodiment of the present invention, an 
oversampled filterbank (e.g. WOLA filterbank) provides low-group delay and the 
ability to make large gain adjustments without adversely impacting signal fidelity or 
adjacent band signals. These combine to offer a high level of protection and high 
15 fidelity. 

Further, calibration and personalization allow the system to be easily adjusted 
to work with a specific acoustic configuration (e.g. a specific headset or headphone). 
Built-in calibration software allows for simplified calibration in the field. 
Personalization allows some parameters of the invention to be adjusted by the user 

20 so that the performance and fidelity suit their specific preferences. 

The embodiments of the present invention provide the ability to log data and 
download it for later analysis. This can be done over a wired or wireless (radio 
frequency or other) link. For example, the acoustic shock protection system can be 
connected to an IP network (and to the Internet) or it can use existing RF 

25 technologies, such as 802.1 1 , Bluetooth or Zigbee. An addressing scheme allows 
the logging and analysis of data from multiple systems so that the invention can be 
used in an environment where a number of units need to have data logged and 
analyzed (e.g. a call center). 

The embodiments of the acoustic shock protection system are applicable to 

30 telephone systems (both mobile and land), mobile and land-based radio systems, 
audio protection systems and similar devices, such as headsets or headphones, that 
protect users from loud sounds. Similar areas of technology include audio limiters 
and dynamic range compressors. 
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The embodiments described above may be implemented by any hardware, 
software or a combination of hardware and software having the above described 
functions. The software code, either in its entirety or a part thereof, may be stored in 
a computer readable memory. Further, a computer data signal representing the 
5 software code, which may be embedded in a carrier wave may be transmitted via a 
communication network. Such a computer readable memory and a computer data 
signal are also within the scope of the present invention, as well as the hardware, 
software and the combination thereof. 

While particular embodiments of the present invention have been shown and 
10 described, changes and modifications may be made to such embodiments without 
departing from the true scope of the invention. 



